Searching for sequence features that control DNA flexibility

نویسندگان

چکیده

Modern genomics experiments measure functional behaviors for many thousands of DNA sequences. Using correlation functions between sequences and measured behaviors, we developed a simple physical model interpreting such experimental outputs. Analysis recent high throughput data on mechanics shows that this is highly effective, leading directly to the extraction distinct features flexibility predictions comparable more complex machine learning models. Our approach follows conventional use in statistical physics connects search relevant sequence stimulus sensory neurons.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dna Compressed and Sequence Searching on Multicore

One of the used of string matching is to search DNA sequence in the DNA database. This simple operation can be done in hours or days, because the huge size of DNA sequence database. On the other hand, the potential of multicore for DNA sequence searching is not fully explored due to the difficulty of multicore programming. This paper evaluates several key string matching algorithms using a comp...

متن کامل

Searching for Interacting Features

Feature interaction presents a challenge to feature selection for classification. A feature by itself may have little correlation with the target concept, but when it is combined with some other features, they can be strongly correlated with the target concept. Unintentional removal of these features can result in poor classification performance. Handling feature interaction can be computationa...

متن کامل

Searching for Interacting Features for Spam Filtering

In this paper, we propose a novel feature selection method— INTERACT to select relevant words of emails for spam email filtering, i.e. classifying an email as spam or legitimate. Four traditional feature selection methods in text categorization domain, Information Gain, Gain Ratio, Chi Squared, and ReliefF, are also used for performance comparison. Three classifiers, Support Vector Machine (SVM...

متن کامل

Searching for Features Defined by Hyperplanes

We consider decision tables with real value conditional attributes and we present a method for extraction of features deened by hyperplanes in a multi-dimensional aane space. These new features are often more relevant for object classiication than the features deened by hyperplanes parallel to axes. The method generalizes an approach presented in 18] in case of hyperplanes not necessarily paral...

متن کامل

Searching for Features using a Genetic Algorithm

Automatic classification of words use abstract representations of lexical items. The representations are usually not easily derived from the data available (strings of letters). This is a core problem in nearest neighbor methods. This article describes research towards a genetic algorithm for inventing features of relevance for automatic word classification. The GA attempts to optimize a repres...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Biophysical Journal

سال: 2022

ISSN: ['0006-3495', '1542-0086']

DOI: https://doi.org/10.1016/j.bpj.2021.11.554